11.8K
Publications
758.4K
Citations
19.7K
Authors
3.4K
Institutions
Probabilistic Corpus Distribution Modeling
1975 - 1981
During the period, corpus analysis centered on probabilistic modeling to understand word usage and term discovery across large text collections. Researchers advanced automatic keyword indexing by employing word-distribution and Poisson-based models to separate specialty terms from general vocabulary, shaping core applications in corpus-based information retrieval. A parallel thread examined discourse variation and distributional properties of form frequencies, employing probabilistic indices and sampling-aware methods to assess reliability across corpora.
Popular Keywords
No papers available
Representativeness-Driven Corpus Analytics
1982 - 1998
Corpus-driven Statistical NLP
1999 - 2005
Cross-Task Distributional Corpus Semantics (2006-2010)
2006 - 2010
Platform-Driven Corpus Analytics
2011 - 2017
Cross-Lingual Multimodal Embeddings
2018 - 2024